智能论文笔记

Synthetic Low-Field MRI Super-Resolution Via Nested U-Net Architecture

Aryan Kalluvila , Neha Koonjoo , Danyal Bhutto , Marcio Rockenbach , Matthew S. Rosen

分类：计算机视觉

2022-11-28

Low-field (LF) MRI scanners have the power to revolutionize medical imaging by providing a portable and cheaper alternative to high-field MRI scanners. However, such scanners are usually significantly noisier and lower quality than their high-field counterparts. The aim of this paper is to improve the SNR and overall image quality of low-field MRI scans to improve diagnostic capability. To address this issue, we propose a Nested U-Net neural network architecture super-resolution algorithm that outperforms previously suggested deep learning methods with an average PSNR of 78.83 and SSIM of 0.9551. We tested our network on artificial noisy downsampled synthetic data from a major T1 weighted MRI image dataset called the T1-mix dataset. One board-certified radiologist scored 25 images on the Likert scale (1-5) assessing overall image quality, anatomical structure, and diagnostic confidence across our architecture and other published works (SR DenseNet, Generator Block, SRCNN, etc.). We also introduce a new type of loss function called natural log mean squared error (NLMSE). In conclusion, we present a more accurate deep learning method for single image super-resolution applied to synthetic low-field MRI via a Nested U-Net architecture.

translated by 谷歌翻译

Werewolf Among Us: A Multimodal Dataset for Modeling Persuasion Behaviors in Social Deduction Games

Bolin Lai , Hongxin Zhang , Miao Liu , Aryan Pariani , Fiona Ryan , Wenqi Jia , Shirley Anugrah Hayati , James M. Rehg , Diyi Yang

分类：机器学习 | 自然语言处理 | 计算机视觉

2022-12-16

Persuasion modeling is a key building block for conversational agents. Existing works in this direction are limited to analyzing textual dialogue corpus. We argue that visual signals also play an important role in understanding human persuasive behaviors. In this paper, we introduce the first multimodal dataset for modeling persuasion behaviors. Our dataset includes 199 dialogue transcriptions and videos captured in a multi-player social deduction game setting, 26,647 utterance level annotations of persuasion strategy, and game level annotations of deduction game outcomes. We provide extensive experiments to show how dialogue context and visual signals benefit persuasion strategy prediction. We also explore the generalization ability of language models for persuasion modeling and the role of persuasion strategies in predicting social deduction game outcomes. Our dataset, code, and models can be found at https://persuasion-deductiongame.socialai-data.org.

translated by 谷歌翻译

Auto-labelling of Bug Report using Natural Language Processing

Avinash Patil , Aryan Jadon

分类：人工智能 | 机器学习

2022-12-13

The exercise of detecting similar bug reports in bug tracking systems is known as duplicate bug report detection. Having prior knowledge of a bug report's existence reduces efforts put into debugging problems and identifying the root cause. Rule and Query-based solutions recommend a long list of potential similar bug reports with no clear ranking. In addition, triage engineers are less motivated to spend time going through an extensive list. Consequently, this deters the use of duplicate bug report retrieval solutions. In this paper, we have proposed a solution using a combination of NLP techniques. Our approach considers unstructured and structured attributes of a bug report like summary, description and severity, impacted products, platforms, categories, etc. It uses a custom data transformer, a deep neural network, and a non-generalizing machine learning method to retrieve existing identical bug reports. We have performed numerous experiments with significant data sources containing thousands of bug reports and showcased that the proposed solution achieves a high retrieval accuracy of 70% for recall@5.

translated by 谷歌翻译

HeRoSwarm: Fully-Capable Miniature Swarm Robot Hardware Design With Open-Source ROS Support

Michael Starks , Aryan Gupta , Sanjay Sarma Oruganti Venkata , Ramviyas Parasuraman

分类：机器人

2022-11-06

Experiments using large numbers of miniature swarm robots are desirable to teach, study, and test multi-robot and swarm intelligence algorithms and their applications. To realize the full potential of a swarm robot, it should be capable of not only motion but also sensing, computing, communication, and power management modules with multiple options. Current swarm robot platforms developed for commercial and academic research purposes lack several of these critical attributes by focusing only on a few of these aspects. Therefore, in this paper, we propose the HeRoSwarm, a fully-capable swarm robot platform with open-source hardware and software support. The proposed robot hardware is a low-cost design with commercial off-the-shelf components that uniquely integrates multiple sensing, communication, and computing modalities with various power management capabilities into a tiny footprint. Moreover, our swarm robot with odometry capability with Robot Operating Systems (ROS) support is unique in its kind. This simple yet powerful swarm robot design has been extensively verified with different prototyping variants and multi-robot experimental demonstrations.

translated by 谷歌翻译

A Comprehensive Survey of Regression Based Loss Functions for Time Series Forecasting

Aryan Jadon , Avinash Patil , Shruti Jadon

分类：机器学习 | 人工智能

2022-11-05

Time Series Forecasting has been an active area of research due to its many applications ranging from network usage prediction, resource allocation, anomaly detection, and predictive maintenance. Numerous publications published in the last five years have proposed diverse sets of objective loss functions to address cases such as biased data, long-term forecasting, multicollinear features, etc. In this paper, we have summarized 14 well-known regression loss functions commonly used for time series forecasting and listed out the circumstances where their application can aid in faster and better model convergence. We have also demonstrated how certain categories of loss functions perform well across all data sets and can be considered as a baseline objective function in circumstances where the distribution of the data is unknown. Our code is available at GitHub: https://github.com/aryan-jadon/Regression-Loss-Functions-in-Time-Series-Forecasting-Tensorflow.

translated by 谷歌翻译

Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems

Mao Ye , Ruichen Jiang , Haoxiang Wang , Dhruv Choudhary , Xiaocong Du , Bhargav Bhushanam , Aryan Mokhtari , Arun Kejariwal , Qiang Liu

分类：机器学习 | 人工智能

2022-09-02

学习在线推荐模型的关键挑战之一是时间域移动，这会导致培训与测试数据分布之间的不匹配以及域的概括错误。为了克服，我们建议学习一个未来的梯度生成器，该生成器可以预测培训未来数据分配的梯度信息，以便可以对建议模型进行培训，就像我们能够展望其部署的未来一样。与批处理更新相比，我们的理论表明，所提出的算法达到了较小的时间域概括误差，该误差通过梯度变异项在局部遗憾中衡量。我们通过与各种代表性基线进行比较来证明经验优势。

translated by 谷歌翻译

HTML版本

Exploring Hate Speech Detection with HateXplain and BERT

Arvind Subramaniam , Aryan Mehra , Sayani Kundu

分类：自然语言处理

2022-08-09

仇恨言论以贬义的评论以多种形式针对社区，并使人类退后一步。 Hatexplain是最近出版的第一个数据集，用于以理由的形式使用带注释的跨度，以及语音分类类别和有针对性的社区，以使分类更具人性化，可解释，准确和偏见。我们调整BERT以理由和阶级预测的形式执行此任务，并比较我们对跨精度，解释性和偏见的不同指标的性能。我们的新颖性是三倍。首先，我们尝试具有不同重要性值的合并理由类损失。其次，我们对理由的地面真相注意值进行了广泛的实验。随着保守和宽大的关注，我们比较了hatexplain模型的性能并检验我们的假设。第三，为了改善模型中的意外偏见，我们使用目标社区单词的掩盖，并注意偏见和解释性指标的改善。总体而言，我们成功地实现了模型的解释性，偏差删除和对原始BERT实施的几个增量改进。

translated by 谷歌翻译

Generalized Frank-Wolfe Algorithm for Bilevel Optimization

Ruichen Jiang , Nazanin Abolfazli , Aryan Mokhtari , Erfan Yazdandoost Hamedani

分类：机器学习 | (统计)机器学习

2022-06-17

在本文中，我们研究了一类二聚体优化问题，也称为简单的双重优化，在其中，我们将光滑的目标函数最小化，而不是另一个凸的约束优化问题的最佳解决方案集。已经开发了几种解决此类问题的迭代方法。 las，它们的收敛保证并不令人满意，因为它们要么渐近，要么渐近，要么是收敛速度缓慢且最佳的。为了解决这个问题，在本文中，我们介绍了Frank-Wolfe（FW）方法的概括，以解决考虑的问题。我们方法的主要思想是通过切割平面在局部近似低级问题的解决方案集，然后运行FW型更新以减少上层目标。当上层目标是凸面时，我们表明我们的方法需要$ {\ mathcal {o}}（\ max \ {1/\ epsilon_f，1/\ epsilon_g \}）$迭代才能找到$ \ \ \ \ \ \ epsilon_f $ - 最佳目标目标和$ \ epsilon_g $ - 最佳目标目标。此外，当高级目标是非convex时，我们的方法需要$ {\ MATHCAL {o}}（\ max \ {1/\ epsilon_f^2,1/（\ epsilon_f \ epsilon_g}）查找$（\ epsilon_f，\ epsilon_g）$ - 最佳解决方案。我们进一步证明了在“较低级别问题的老年人错误约束假设”下的更强的融合保证。据我们所知，我们的方法实现了所考虑的二聚体问题的最著名的迭代复杂性。我们还向数值实验提出了数值实验。与最先进的方法相比，展示了我们方法的出色性能。

translated by 谷歌翻译

The Power of Adaptivity in SGD: Self-Tuning Step Sizes with Unbounded Gradients and Affine Variance

Matthew Faw , Isidoros Tziotis , Constantine Caramanis , Aryan Mokhtari , Sanjay Shakkottai , Rachel Ward

分类： (统计)机器学习 | 机器学习

2022-02-11

我们研究了Adagrad-norm的收敛速率，作为自适应随机梯度方法（SGD）的典范，其中，基于观察到的随机梯度的步骤大小变化，以最大程度地减少非凸，平稳的目标。尽管它们很受欢迎，但在这种情况下，对自适应SGD的分析滞后于非自适应方法。具体而言，所有先前的作品都依赖以下假设的某个子集：（i）统一结合的梯度规范，（ii）均匀遇到的随机梯度方差（甚至噪声支持），（iii）步骤大小和随机性之间的有条件独立性坡度。在这项工作中，我们表明Adagrad-norm表现出$ \ Mathcal {O} \ left（\ frac {\ mathrm {poly} \ log（t）} {\ sqrt {\ sqrt {t}}} \ right）的订单最佳收敛率$在$ t $迭代之后，在与最佳调整的非自适应SGD（无界梯度规范和仿射噪声方差缩放）相同的假设下进行了$，而无需任何调整参数。因此，我们确定自适应梯度方法在比以前了解的更广泛的方案中表现出最佳的融合。

translated by 谷歌翻译

Bayesian Optimization over Permutation Spaces

Aryan Deshwal , Syrine Belakaria , Janardhan Rao Doppa , Dae Hyun Kim

分类：机器学习 | 人工智能

2021-12-02

优化昂贵以评估黑盒功能在包含D对象的所有排列中的输入空间是许多真实应用的重要问题。例如，在硬件设计中放置功能块以通过仿真优化性能。总体目标是最小化函数评估的数量，以找到高性能的排列。使用贝叶斯优化（BO）框架解决这个问题的关键挑战是折衷统计模型的复杂性和采集功能优化的途径。在本文中，我们提出并评估了博的两个算法（BOPS）。首先，BOPS-T采用高斯工艺（GP）代理模型与KENDALL内核和基于Thompson采样的Trocable采集功能优化方法，以选择评估的排列顺序。其次，BOPS-H采用GP代理模型与锦葵内核和启发式搜索方法，以优化预期的改进采集功能。理论上，从理论上分析BOPS-T的性能，以表明他们的遗憾增加了亚线性。我们对多种综合和现实世界基准测试的实验表明，BOPS-T和BOPS-H均优于组合空间的最先进的BO算法。为了推动未来的对这个重要问题的研究，我们为社区提供了新的资源和现实世界基准。

translated by 谷歌翻译